Approximating optimal solution structure with edit distance and its applications
نویسنده
چکیده
An alternative notion of approximation arising in cognitive psychology, bioinformatics and linguistics is that of computing a solution which is structurally close to an optimal one. That is, an approximate solution is considered good if its distance from an optimal solution is small, for a distance measure such as Hamming distance or edit distance. There has been a modicum of work on approximating solution structure according to an arbitrary solution distance function [HMvRW07,vRW12]; several papers addressed the complexity of approximating a witness to an NP problem to within bounded Hamming distance [GHLP99,KS99,FLN00,SY13]. Most of these results are strong lower bounds. In this paper, we extend Hamming distance inapproximability results of Sheldon and Young [SY13] to a setting where a solution is considered good if it is within a given edit distance bound from some optimal solution. We show several inapproximability results for this measure, in particular extending [SY13] n/2− n inapproximability ∀ > 0 with respect to Hamming distance to edit distance for SAT and VertexCover problems. Finally, we show that these lower bounds for both edit distance and Hamming distance apply to two practical problems: Longest Common Subsequence and Weighted Sentence Alignment.
منابع مشابه
Approximating solution structure of the Weighted Sentence Alignment problem
We study the complexity of approximating solution structure of the bijective weighted sentence alignment problem of DeNero and Klein (2008). In particular, we consider the complexity of finding an alignment that has a significant overlap with an optimal alignment. We discuss ways of representing the solution for the general weighted sentence alignment as well as phrases-to-words alignment probl...
متن کاملA Computational Method for Solving Optimal Control Problems and Their Applications
In order to obtain a solution to an optimal control problem, a numerical technique based on state-control parameterization method is presented. This method can be facilitated by the computation of performance index and state equation via approximating the control and state variable as a function of time. Several numerical examples are presented to confirm the analytical findings and illus...
متن کاملComparing Stars: On Approximating Graph Edit Distance
Graph data have become ubiquitous and manipulating them based on similarity is essential for many applications. Graph edit distance is one of the most widely accepted measures to determine similarities between graphs and has extensive applications in the fields of pattern recognition, computer vision etc. Unfortunately, the problem of graph edit distance computation is NP-Hard in general. Accor...
متن کاملObliviously Approximating Sequence Distances
There are several applications for schemes which approximately nd the distance between two sequences in a way that isòblivious' of one of the sequences up until a nal sublinear number of comparisons. This paper shows how sequences can be preprocessed obliviously to give a binary string, so that a simple vector distance between two bitstrings gives an approximation to a sequence distance of inte...
متن کاملA Short Survey of Document Structure Similarity Algorithms
This paper provides a brief survey of document structural similarity algorithms, including the optimal Tree Edit Distance algorithm and various approximation algorithms. The approximation algorithms include the simple weighted tag similarity algorithm, Fourier transforms of the structure, and a new application of the shingle technique to structural similarity. We show three surprising results. ...
متن کامل